Computational Haplotype Analysis: An overview of computational methods in genetic variation study

نویسندگان

  • Phil Hyoun Lee
  • Hagit Shatkay
چکیده

One of the major interests of current genomics research is disease-gene association, that is, identifying which DNA variation or a set of DNA variations is highly associated with a specific disease. In particular, single nucleotide polymorphisms (SNPs), which are the most common form of DNA variation on the human genome, and a set of SNPs on one chromosome, referred to as a haplotype, are at the forefront of the disease-gene association studies. In general, when haplotype information is used for studying disease-gene association, it is called haplotype analysis. Numerous studies have shown that haplotype analysis can successfully identify the DNA variations relevant to several common and complex human diseases. However, despite its advantages over other approaches, the use of haplotype analysis has been limited due to the high cost and long operation time of bio-molecular methods for obtaining the haplotype information. To address this limitation, two computational procedures, namely, Haplotype Phasing and Tag SNP Selection have been incorporated in haplotype analysis, and now provide the most practical framework for conducting large-scale association studies. In this depth paper, we introduce an overview of computational haplotype analysis, survey the existing approaches for Haplotype Phasing and Tag SNP Selection, and discuss their open problems. Given the current state of the field, as presented in this survey, we plan to conduct further research in the area of Tag SNP Selection. Chapter

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

O-36: Genome Haplotyping and Detection of Meiotic Homologous Recombination Sites in Single Cells, A Generic Method for Preimplantation Genetic Diagnosis

Background: Haplotyping is invaluable not only to identify genetic variants underlying a disease or trait, but also to study evolution and population history as well as meiotic and mitotic recombination processes. Current genome-wide haplotyping methods rely on genomic DNA that is extracted from a large number of cells. Thus far random allele drop out and preferential amplification artifacts of...

متن کامل

The axisymmetric computational study of a femoral component to analysis the effect of titanium alloy and diameter variation.

This work presents a numerical approach in order to predict the influence of implant material stiffness in a femoral component design when submitted in compression. The implant success depends on the transferred load to the neighboring bone. The finite element method can be used to analysis the stress and strain distribution in the femoral component allowing to improve the implant selection. Fo...

متن کامل

Airfoil Shape Optimization with Adaptive Mutation Genetic Algorithm

An efficient method for scattering Genetic Algorithm (GA) individuals in the design space is proposed to accelerate airfoil shape optimization. The method used here is based on the variation of the mutation rate for each gene of the chromosomes by taking feedback from the current population. An adaptive method for airfoil shape parameterization is also applied and its impact on the optimum desi...

متن کامل

Machine Learning for the Identification of the Dna Variations for Diseases Diagnosis

In this paper we give an overview of a basic computational haplotype analysis, including the pairwaise association with the use of clustering, and tagged prediction (using Bayesian networks). Moreover, we present several machine learning methods in order to explore the association between human genetic variations and diseases. These methods include the clustering of SNPs based on some similarit...

متن کامل

An Evolutionary and Phylogenetic Study of the BMP15 Gene

DNA sequence data contains a wealth of biologically useful information. Recent innovations in DNA sequencing technology have greatly increased our capacity to determine massive amounts of nucleotide sequences. These sequences can be used to specify the characteristics of different regions, interpret the evolutionary relationships between categorized groups, likelihood of performing multiple com...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1985